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The regression depth of a hyperplane with respect to a set of n points in W 1 is the minimum 
qj ' number of points the hyperplane must pass through in a rotation to vertical. We generalize 

^"^ ■ hyperplane regression depth to &-fiats for any k between and d — 1 . The k = case gives the 

classical notion of center points. We prove that for any k and d, deep £-flats exist, that is, for 
any set of n points there always exists a £-flat with depth at least a constant fraction of n. As a 
consequence, we derive a linear-time (1 + ^-approximation algorithm for the deepest fiat. 



o. 

U ■ 1 Introduction 



Linear regression asks for an affine subspace (a flat) that fits a set of data points. The most fa- 
miliar case assumes d — I independent or explanatory variables and one dependent or response 
variable, and fits a hyperplane to explain the dependent variable as a linear function of the inde- 
pendent variables. Quite often, however, there may be more than one dependent variable, and the 
multivariate regression problem requires fitting a lower-dimensional flat to the data points, perhaps 
^vq ■ even a succession of flats of increasing dimensions. Multivariate least-squares regression is easily 

solved by treating each dependent variable separately, but this is not correct for other common forms 
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of regression such as least absolute deviation fl8n or least median of squares [12] 



Rousseeuw and Hubert Q14| ] introduced the notion of regression depth as a robust criterion for 
linear regression. The regression depth of a hyperplane H fitting a set of n points is the minimum 
number of points whose removal makes H into a nonfit. A nonfit is a hyperplane that can be rotated 
to vertical (that is, parallel to the dependent variable's axis) without passing through any points. 
The intuition behind this definition is that a vertical hyperplane posits no relationship between the 
dependent and independent variables, and hence many points should have to be invalidated in order 
to make a good regression hyperplane combinatorially equivalent to a vertical hyperplane. Since 
this definition does not make use of the size of the residuals, but only uses their signs, it is robust in 
the face of skewed or heteroskedastic (data-dependent) error models. Regression depth also has a 
number of other nice properties including invariance under affine transformations, and a connection 
to the classical notions of center points and data depth. 

This paper generalizes regression depth to the case of more than one dependent variable, that is, 
to fitting a k-Hat to points in R rf . This generalization is not obvious: for example, consider fitting a 
line to points in R 3 . Generic lines can be rotated wherever one likes without passing through data 
points, so how are we to distinguish one line from another? 
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We start by reviewing previous work (Section and stating our basic definitions (Section 
We then provide a lemma that may be of independent interest, on finding a family of large subsets of 
any point set such that the family has no hyperplane transversal (Section ||). We prove the existence 
of deep &-flats for any k (Section ||), and give tight bounds on depths of lines in M? (Section |6|). 
We conclude by discussing related generalizations of Tverberg's theorem (Section describing 
possible connections between fc-flats and (d — k — l)-flats (Section |8|), and outlining the algorithmic 
implications of our existence proof (Section ^). Along with the results proven in each section, we 
list open problems for further research. 



2 Previous Work 



Regression depth was introduced by Rousseeuw and Hubert [14] as a combinatorial measure of 
the quality of fit of a regression hyperplane. An older notion, variously called data depth, location 
depth, halfspace depth, or Tukey depth, similarly measures the quality of fit of a single-point esti- 
mator. It has long been known that there exists a point of location depth at least \n/(d + 1)] (a 
center point). Rousseeuw and Hubert provided a construction called the catline [1] for computing 
a regression line for a planar point set with depth at least fn/3] , and conjectured [ 1J] that in higher 
dimensions as well there should always exist a regression hyperplane of depth \n/(d + 1)] . Steiger 
and Wenger Jl6| ] proved that a deep regression hyperplane always exists, but with a much smaller 
fraction than \l/(d + 1)]. Amenta, Bern, Eppstein, and Teng [Q] solved the conjecture using an 
argument based on Brouwer's fixed-point theorem and a close connection between regression depth 
and center points. 



On the algorithmic front, Rousseeuw and Struyf [15] gave algorithms for testing the regression 
depth of a hyperplane. Their time bounds are exponential in the dimension, unsurprising since the 
problem is NP-complete for unbounded dimension ph. For the planar case, Van Kreveld, Mitchell, 
Rousseeuw, Shark, Snoeyink, and Speckmann gave an 0(n log 2 n) algorithm for computing a deep- 
est line [|[]. Langerman and Steiger [f7|] later improved this to an optimal 0(n\ogn) time bound. 



3 Definitions 

Although regression is naturally an affine rather than projective concept, our constructions and 
definitions live most gracefully in projective space. We view ^-dimensional real projective space as 
a renaming of objects in (d + 1) -dimensional affine space. (Affine space is the familiar Euclidean 
space, only we have not specified a distance metric.) A k-flat, for 1 < k < d, through the origin of 
(d + 1) -dimensional affine space is a projective (k — I) -flat. In particular a line through the origin is 
a projective point and a plane through the origin is a projective line. A projective line segment is the 
portion of a projective line between two projective points, that is, a pair of opposite planar wedges 
with vertex at the origin. 

We can embed affine <i-space into projective space as a hyperplane that misses the origin. There 
is a unique line through any point of this hyperplane and the origin, and hence each point of affine 
space corresponds to a unique projective point. There is, however, one projective hyperplane, and 
many projective &-flats for k < d — 1, without corresponding affine flats; these are the projective 
&-flats parallel to the affine space. We say that these flats are at infinity. 

Each projective point p has a dual projective hyperplane D(p), namely the hyperplane orthog- 
onal to p at the origin in {d + 1) -dimensional affine space. Similarly a projective fc-flat dualizes 
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to its orthogonal (d — &)-fiat. Notice that in projective space, unlike in affine space, there are no 
exceptional cases: each &-flat is the dual of a (d — &)-flat. 

Now let X be a set of points in <f-dimensional projective space. (From now on we shall just 
say "point", "line", etc. rather than "projective point", "projective line", when there is no risk of 
confusion.) We now propose a key definition: a distance between flats with respect to the points 
in X. The definition is more intuitive in the dual formulation than in the primal, but we give both 
below for completeness. Let D(F) denote the flat that is dual to flat F and let D(X) denote the set 
of hyperplanes that are dual to points of X. A double wedge is the (closed) region between two 
projective hyperplanes. 

Definition 1 The crossing distance between two flats F and G with respect to X is the minimum 
number of hyperplanes ofD{X) intersected by a (closed) projective line segment with one endpoint 
on D(F) and the other on D(G). In the primal formulation, the crossing distance between F and G 
is the minimum number of points ofX in a double wedge that contains F in one bounding hyperplane 
and G in the other. 

We now turn our attention to linear regression and for ease of understanding, we return tem- 
porarily to J-dimensional affine space. Assume that we designate k dimensions as independent 
variables and d — k as dependent variables. Let / denote the linear subspace spanned by the in- 
dependent dimensions. We call a &-flat vertical if its projection onto / is not full-dimensional, that 
is, if its projection is not all of /. For example, let k = 1 and d = 3 and think of the x-axis as 
representing the independent variable; then any line contained in a vertical plane (that is, parallel to 
the yz-plane) is vertical. 

In projective space, a &-fiat is vertical if and only if it contains a point in a particular (d — k — 1)- 
flat at infinity, which we call the (d — k — \)-flat at vertical infinity and denote by V^-k-i- 

Definition 2 The regression depth of a k-flat F is its crossing distance from V^-k-i- Equivalently, 
the regression depth of F is the minimum number of points whose removal makes F into a nonfit, 
where a nonfit is a k-flat with crossing distance zero from Va-k-i- 

Any &-flat at infinity meets V^^-l an d therefore has depth zero. Therefore, any method for 
selecting a fc-flat of nonzero regression depth will automatically choose a fc-flat coming from the 
original affine space, rather than one that exists only in the projective space used for our definitions. 

Note that, unlike the case for ordinary least squares, there does not seem to be any way of 
solving &-fiat regression separately for each dependent variable. Even for the problem of finding a 
line in M?, combining the solution to two planar regression lines may result in a nonfit. 

4 Nontransversal Families 

In order to prove that deep fc-flats exist, we need some combinatorial lemmas on large subsets of 
points without a hyperplane transversal. 

Definition 3 Let S be a set of points. Then we say that a hyperplane H cuts S if both of the two open 
half spaces bounded by H contain at least one point ofS. We say that a family of sets is transversal 
if there is a hyperplane that cuts all sets in the family. 
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Figure 1: Construction of three non-transversal sets of n/6 points in M. 2 : (a) catline, formed by 
partitioning the point set vertically into equal thirds, and making a ham sandwich cut of the leftmost 
and rightmost 2n/3 points; (b) subdivision by three coincident lines into equal sixths. 

Lemma 1 ( [§,[l9|]) Let d be a constant, and assume we are given a set of n points in ~R d and a 
parameter p. Then we can partition the points into p subsets, with at most Injp points in each 
subset, such that any hyperplane cuts o(p) of the subsets. 

Lemma 2 ( [|lj]) Let p > q > d be constants. Then there is a constant C(p, q, d) with the following 
property: If J- is any family of point sets in M. d , such that any p-tuple of sets in T contains a 
transversal subfamily of q sets, then T can be partitioned into C(p, q, d) transversal subfamilies. 

Theorem 1 Let d be a constant. Then there is a constant P(d) with the following property: For any 
set S of n points in M. d , we can find a non-transversal family of d + 1 subsets of S, such that each 
subset in the family contains at least \n/P(d)~\ points ofS. 

Proof: Choose p to be a multiple of three, sufficiently large that the o(p) bound of Lemma [j] is 
strictly smaller than p/(3C(d + 1, d + l,d)), and let P(d) = 2p. By Lemma [T], partition S into p 
subsets of at most 2n/p points, such that any hyperplane cuts few subsets. 

Let T be the family consisting of the largest p/3 subsets in the partition. If the smallest member 
of T contains m points, then the total size of all the members of the partition would have to be at 
most (p/3) ■ Injp + (2p/3) ■ m = 2n/3 + 2pm/3, but this total size is just n, so m > n/(2p) and 
each member of T contains at least n/P(d) points. 

If each (d + 1) -tuple of sets in T were transversal, we could apply Lemma || and partition T 
into C(p, q, d) transversal subfamilies, one of which would have to contain at least \T\/C{p, q, d) = 
p/(3C(d + 1, d + l,d)) subsets. But this violates the o[p) bound of Lemma [I], so T must contain a 
non-transversal (d + 1) -tuple. This tuple fulfills the conditions of the statement of the lemma. ■ 

Clearly, P(l) =2 since the median partitions any set of points on a line into two nontransversal 
subsets. Figure [j] depicts two different constructions showing that P(2) < 6. Although the bound 
of six is tight for these two constructions (as can be seen by the example of points equally spaced 
on a circle) we do not know whether there might be a different construction that achieves a better 
bound; the best lower bound we have found is the following: 

Theorem 2 P(2) > ^—r-r w 4.622. 

2 sin 3 
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Figure 2: (a) Four great circles subdivide a sphere in the pattern of a cuboctahedron. (b) Four pairs 
of opposite points, connected by great circle arcs in the pattern of a cuboid. 

Proof: We form a distribution on the plane by centrally projecting the uniform distribution on a 
sphere. We show that any nontransversal triple for this distribution must have a set with measure 
at most 1 /4.622 times the total measure. The same bound then holds in the limit for discrete point 
sets formed by taking e-approximations of this distribution. 

Let Si, i € {1, 2, 3} denote the three nontransversal subsets of the plane maximizing the min- 
imum measure of any 5,. Without loss of generality, each Sj is convex. Consider the three lines 
tangent to two of the S,, and separating them from the third set (such a line must exist since the 
sets are nontransversal). These lines form an arrangement with seven (possibly degenerate) faces: a 
triangle adjacent on its edges to three three-sided infinite cells, and on its vertices to three two-sided 
infinite cells. The sets Sj coincide with the three-sided infinite cells: any set properly contained in 
such a cell could be extended to the whole cell without violating the nontransversality condition, 
and if they instead coincided with the two-sided cells we could shrink the arrangement's central 
triangle while increasing the sizes of all three Sj. The two arrangements in Figure |l| can both be 
viewed as such three-line arrangements, degenerate in different ways. 

Each line in the plane lifts by central projection to a great circle on the sphere. Consider the great 
circles formed by lifting four lines: the three lines considered above and the line at infinity. Any 
arrangement of four circles on the sphere cuts the sphere in the pattern of a (possibly degenerate) 
cuboctahedron (Figure ^|(a)). The three-sided infinite cells in the plane lift to quadrilateral faces of 
this cuboctahedron. Note that the area of a spherical quadrilateral is the sum of its internal angles, 
minus 27r. 

Form the dual of the arrangement by treating each great circle as an "equator" and placing a 
pair of points at the corresponding two poles. The geodesies between these points have the pattern 
of a cuboid (Figure ^b)) such that the length of each geodesic is an angle complementary to one 
of the cuboctahedron quadrilaterals' internal angles. Thus, the cuboid minimizing the maximum 
quadrilateral perimeter corresponds to the cuboctahedron maximizing the minimum quadrilateral 
area. But any spherical cuboid has at least one face covering at least one-sixth of the sphere, and the 
minimum perimeter for such a quadrilateral is achieved when the quadrilateral is a square. There- 
fore, the regular cube minimizes the maximum perimeter and the regular cuboctahedron maximizes 
the minimum area. The ratio of a regular cuboctahedron's square face area to the area of a full 
hemisphere is the value given, ri — r ~ 4.622. ■ 

2 sin 1/3 
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We also do not know tight bounds on P(d) for d > 3. The proof of Theorem |l| (using the 
best known bounds in Lemma [j] ^) leads to upper bounds of the form 0(C(d + l,d + l,d)~ d ). 
We may be able to improve this somewhat, to 0(C(d + l,d,d — l) l ~ d ), by a more complicated 
construction: Project the points arbitrarily onto a (d — 1) -dimensional subspace, find a partition in 
the subspace, and use Lemma || to find a family of d + 1 subsets such that no subfamily of d subsets 
has a transversal. As in the catline construction [|J], group these subsets into d pairs, and form a ham 
sandwich cut in M. d of these pairs, in such a way that this cut partitions each subset of the family 
in the same proportion a : b, and such that the half-subsets of size a are above or below the ham 
sandwich cut accordingly as the members of the family of d + 1 subsets are on one or the other side 
of a Radon partition of those subsets in R rf_1 . Without loss of generality, a > b; then choose the 
d + 1 subsets required by Lemma [j] to be the ones of size a. 

Open Problem 1 Prove tighter bounds on P{d) for d > 2. 



5 Deep &-Flats 



It is previously known that deep &-flats exist for k = [10] and k = d — 1 16]. In this section we 
show that such flats exist for all other values of k. 

We first need one more result, a common generalization of centerpoints and the ham sandwich 
theorem: 



Lemma 3 (The Center Transversal Theorem [g, 20 1) Let k + 1 point sets be given in Mr, each 
containing at least m points, where < k < d. Then there exists a k-flat F such that any closed 
half space containing F contains at least \m/(d — k + 1)] points from each set. 

The weaker bound \m/(d+ 1)] can be proven simply by choosing a flat through the centerpoint 
of each subset. 



Theorem 3 Let d and < k < d be constants. Then there is a constant R(d, k) such that for any 
set of n points with k independent and d — k dependent degrees of freedom, there exists a k-flat of 
regression depth at least \n/R(d,k)~\. 

Proof: Project the point set onto the subspace spanned by the k independent directions, in such 
a way that the inverse image of each point in the projection is a (d — &)-flat containing Vd-k-\- 
By Theorem ^J, we can find a family of k + 1 subsets of the data points, each with n/P(k) points, 
such that the ^-dimensional projection of this family has no transversal. We then let F be the k-t\at 
determined by applying Lemma |3| to this family of subsets. 

Then consider any double wedge bounded by a hyperplane containing F and a hyperplane con- 
taining V c i-k-\- The vertical boundary of this double wedge projects to a hyperplane in R^, so it 
must miss one of the k+ 1 subsets in the family. Within this missed subset the double wedge appears 
to be simply a halfspace through F. By Lemma ||, the double wedge must therefore contain at least 
n/((d -k + l)P{k)) points. Thus if let R(d, k) = (d - k + the theorem is satisfied. ■ 

For k = or k = d— lwe know that R(d, k) = d + 1 [g]. However exact values are not known 
for intermediate values of k. 

Open Problem 2 Prove tighter bounds on R(d, k)for 1 < k < d — 2. 
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The following conjecture would follow from the assumption that R(d, k) is a linear function 
of d for fixed k (as the 0(d) bound of Theorem || makes plausible), since R(k, k) = 1 and R(k + 
l,k) = k + 2. It also matches the known results R(d, 0) = R(d, d — 1) = d + 1 and the bounds 
1) < 2d — 1 and /?(3, 1) = 5 of the following section. 

Conjecture 1 = (A + l)(d - k) + 1. 

6 Tighter Bounds for Lines 

The proof of Theorem |5| shows that R(d, 1) < 2<f ; this can be slightly improved using a technique 
of overlapping sets borrowed from the catline construction. 

Theorem 4 R(d, 1) < 2d - 1. 

Proof: The proof of Theorem || can be viewed as projecting the points onto a horizontal line, 
dividing the line into two rays at the median of the points, and applying the center transversal 
theorem to the two sets of n/2 points contained in each ray. Instead, we project the points onto the 
horizontal line as before, but partition this line into three pieces: two rays containing (d—\)n/(2d — 
1) points each, and a line segment in the middle containing the remaining n/(2d — 1) points. We 
then apply the center transversal theorem to two sets S\ and S2 of dn / (2d — 1 ) points each, formed 
by the points having a projection in the union of the middle segment and one of the two rays. This 
theorem finds a line such that no halfspace containing it has fewer than n / (2d — 1 ) points in either 
of the sets 5/. We claim that this line has regression depth at least n/(2d — 1). 

To prove this, consider any double wedge in which one hyperplane boundary contains the re- 
gression line, and the other hyperplane boundary is vertical. The vertical hyperplane then intersects 
the horizontal projection line in a single point. If this intersection point is in one of the two rays, 
then the vertical hyperplane misses the set S, formed by the other ray and the middle segment. In 
this case, the double wedge contains the same subset of Sj as a halfspace bounded by the double 
wedge's other bounding plane, and so contains at least n/(2d — 1) points of Si 

In the remaining case, the vertical boundary of the double wedge intersects the horizontal pro- 
jection line in its middle segment. Within each of the two sets to which we applied the center 
transversal theorem, the double wedge differs from a halfspace (bounded by the same nonvertical 
plane) only within the middle set. But any hyperplane bounds two different halfspaces, and the 
halfspace approximating the double wedge in Si is opposite the halfspace approximating the double 
wedge in S2. Therefore, if we let X,- denote the set of points in the halfspace but not in the double 
wedge, then X\ and X 2 are disjoint subsets of the middle n/(2d—\) points. The number of points in 
the double wedge within one set S, must be at least n/(2d — 1) — |X;|, so the total number of points 
in the double wedge is at least 2nj (2 d - 1) - |Xi UX 2 | < 2n/(2d- 1) - n/(2d - 1) = n/(2d- 1). 

Thus in all cases a double wedge bounded by a hyperplane through the regression line and by 
a vertical hyperplane contains at least n/(2d — 1) points, showing that the line has depth at least 
n/(2d - 1). ■ 

As evidence that this 2d — 1 bound may be tight, we present a matching lower bound for d = 3. 
Theorem 5 R(3, 1) = 5. 
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Figure 3: Three-dimensional plane arrangement showing R(3, 1) > 5: (a) cross-section for x = 1; 
(b) cross-section for x = — 1 . 

Proof: We have already proven that R(3, 1) < 5, so we need only show that R(3, 1) > 5. We work 
in the dual space, and construct an arrangement of n planes in M?, for n any multiple of 5, such that 
any line has depth at most n/5. 

Our arrangement consists of five groups of nearly parallel and closely spaced planes, which 
we label A\, Ai, B\, B^, and C. Rather than describe the whole arrangement, we describe the line 
arrangements in the planar cross-sections at x = 1 and x = — 1. Recall that the depth of a line in 
the three-dimensional arrangement is the minimum number of planes crossed by any vertical ray 
starting on the line. Limiting attention to rays contained in the two cross-sections (and hence to the 
planar depth of the two points where the given line intersects these cross-sections) gives an upper 
bound on the depth of the line, and so a lower bound on R(3, 1). 

In the first cross-section, we place the groups of lines as shown in Figure ||(a), with the region 
where A\ and A 2 cross contained inside the triangle formed by the other three groups. Moreover, 
A\ and A2 do not cross side B2 of the triangle, instead crossing group B2 at points outside the 
triangle. The points where members of A\ intersect each other are positioned on segment CA\ - 
A\Ai- Similarly, the crossings within A2 are situated on segment A\Ai - A^Bi. The crossings 
within B\, B2, and C are situated along the corresponding sides of the triangle formed by these three 
groups. 

In the cross-section formed as described above, points from most cells in the arrangement can 
reach infinity while crossing only one group, and so have depth at most n/5. It is only within the 
segments CA\ - A1A2 and A\A2 - A2B1 that a point can have higher depth. The arrangement is 
qualitatively similar for nearby cross-sections x = 1 ± e. Therefore, any deep line in M? must be 
either nearly parallel to A\ and not near any B,, or nearly parallel to A2 and not near #2- 

In the second cross-section (Figure ^(b)), the groups A/ and Bj reverse roles: the point where B\ 
and B2 cross is contained in the triangle determined by the other three groups, and the other details 
of the arrangement are situated in a corresponding manner. Therefore, any deep line would have to 
be either nearly parallel to B\ and not near any A,, or nearly parallel to B2 and not near A2. 

There is no difficulty forming the two cross-sections described above from a single plane ar- 
rangement, since (as shown in Figures 3(a) and (b)) the slopes of the lines within each group can 
remain the same in each cross-section. But the requirements imposed on a deep line by these two 
cross-sections are contradictory, therefore no line can have depth greater than n/5 in this arrange- 
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ment. ■ 

We believe that a similar proof can be used to prove a more general 2d — 1 lower bound on 
R(d, 1) in any dimension, matching the upper bound in Theorem ^[ form an arrangement with 
hyperplane groups A,-, and C, so that in one cross-section the A, meet in a vertex contained in a 
simplex formed by the other groups, and in the other cross-section the groups A, and B, exchange 
roles. However we have not worked out the details of where to place the intersections within groups, 
how to choose hyperplane angles such that the inner groups miss a face of the outer simplex in both 
cross-sections, or which cells of the resulting arrangements can have high depth. 



7 Generalizations of Tverberg's Theorem 

A Tverberg partition of a set of point sites is a partition of the sites into subsets, the convex hulls of 
which all have a common intersection. The Tverberg depth of a point t is the maximum cardinality 
of any Tverberg partition for which the common intersection contains t. Note that the Tverberg 
depth is a lower bound on the location depth. Tverberg's theorem [jl7, 18] is that there always exists 
apoint with Tverberg depth \n/(d+ 1)] (a Tverberg point); this result generalizes both the existence 
of center points (since any Tverberg point must be a center point) and Radon's theorem [|ll|] that 
any d + 2 points have a Tverberg partition into two subsets. 

Another way of expressing Tverberg's theorem is that for any point set we can find both a 
partition into \n/(d + 1)] subsets, and a point t, such that t has nonzero depth in each subset of the 
partition. Stated this way, there is a natural generalization to higher dimensional flats: 

Theorem 6 Let d and < k < d be constants. Then there is a constant T(d, k) such that for any 
set of n points with k independent and d — k dependent degrees of freedom, there exists a k-flat F 
and a partition of the points into \n/T(d, k)~\ subsets, such that F has nonzero regression depth in 
each subset. 

Proof: As in the proof of Theorem |3[ we project the point set onto the subspace spanned by the 
k independent directions, in such a way that the inverse image of each point in the projection is 
a (d — fc)-fiat containing Vd-k-\- By Theorem [ll we can find a family of k + 1 subsets 5,-, each 
with n/P{k) points, such that the ^-dimensional projection of this family has no transversal. We 
then find a Tverberg point and a Tverberg partition of each set S; into subsets Tn, for 1 < j < 
\n/(P(k)(d + 1))] . We let F be the fc-flat spanning these k + 1 Tverberg points. We form each set 
Tj in our Tverberg partition as the union U,T, ■ ,-. Some points of S may not belong to any set Tn, in 
which case they can be assigned arbitrarily to some set r, in the partition. 

Then consider any double wedge bounded by a hyperplane containing F and a hyperplane con- 
taining V,j-k-i- The vertical boundary of this double wedge projects to a hyperplane in ~R k , so it 
must miss one of the k + 1 subsets Sj. Within Sj the double wedge appears to be simply a half space 
through ?,. It therefore contains at least one point of each set 7y and a fortiori at least one point of 
each set 7). Thus if let R(d, k) = {a I + l)P(k) the theorem is satisfied. ■ 

We know that T(d, 0) = d + 1 by Tverberg's theorem, and the catline construction [Q] shows 
that T(2, 1) = 3. However even in the case k = d— 1 we do not know a tight bound; Rousseeuw and 
Hubert [|TJ,|^] conjectured that T(d, d — 1) = d + 1 but the best known bounds from our previous 
paper [g] are T(d, d - 1) < d(d + 1) and T(3, 2) < 6. 

Open Problem 3 Prove tighter bounds on T(d, k)for 1 < k < d — 1. 
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8 Connection Between &-Flats and (d — k — 1) -Flats 



There is a natural relation between finding a deep k-fiat and finding a deep (d — k — l)-flat: in both 
cases one wants to find a k-fiat and a (d — k — l)-flat that are far apart from each other, and the 
problems only differ in which of the two flats is fixed at vertical infinity, and which is to be found. 

In our previous paper [^] we exploited this connection in the following way, to show that 
R(d,d — 1) = d + 1. A centerpoint (corresponding to the value of R(d,0)) is just a point far 
from a given "hyperplane at infinity"; in projective cf-space, this hyperplane can be chosen arbi- 
trarily, resulting in different centerpoint locations. We first found an appropriate way to replace the 
input point set by a smooth measure, and modify the definition of a centerpoint, in such a way that 
we could show that the modified centerpoint location varied continuously and equivariantly, as a 
function of the position of the hyperplane at infinity. In oriented projective space, the set of hyper- 
plane locations and the set of centerpoint locations are both topological J-spheres, so we could have 
applied the Borsuk-Ulam theorem (in the form that every continuous equivariant function from the 
sphere to itself is surjective) to find a hyperplane in the inverse image of the point at vertical infinity; 
this hyperplane is the desired deep regression plane. Our actual proof used the Brouwer fixed point 
theorem in a similar way, avoiding the need to use the equivariance property. 

Conjecture [I] implies in more generality that R(d,k) = R(d, d — k — 1), and one would naturally 
hope for a similar proof of this equality. There are two obstacles to such a hope: First, we do not 
know how to modify the definition of a deep fc-flat in such a way as to choose a unique flat which 
varies continuously as a function of the location of the (d — k — l)-flat at infinity. A similar lack of 
a continuous version of Tverberg's theorem blocked our attempts to prove that T{d,d—\) = d+ 1. 
However, some of our constructions (for instance the bound 2(d+ 1) on R(d, 1) formed by vertically 
bisecting the points and choosing a line through the centerpoints of each half) can be made contin- 
uous using ideas from our previous paper. Second, and more importantly, the space Ff of oriented 
&-flats does not form a topological sphere, and there can be continuous equivariant non-surjective 
functions from this space to itself. Nevertheless there might be a way of using generalizations of 
the Borsuk-Ulam theorem [||] or a modification of our Brouwer fixed point argument to show that 
the deep k-fiat function must be surjective, perhaps using the additional property that a deep &-flat 
cannot be incident to Vd-k- 1 ■ 

Open Problem 4 Can there exist a continuous non-surjective ^-equivariant map c from F^_,_j 
to Ff such that any (d — k — \)-flat V and its image c(V) are never incident? 

Open Problem 5 Does R(d, k) = R(d, d — k— I) for 1 <k <d — 2? 

Open Problem 6 Does T(d,k) = T(d,d — k — I) for <k < d — \ ? 

9 Algorithmic Implications 

We now show how to use our proof that deep flats exist as part of an algorithm for finding an 
approximate deepest flat. We begin with an inefficient exact algorithm. 

Theorem 7 Let d and k be constants. Then we can find the deepest k-fiat for a collection ofn points 
in R d , in time n°( l \ 
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Proof: Let A be the arrangement of hyperplanes dual to the n given points. The distance from 
points in IR rf to V^-k-i is constant within each cell of A, and all such distances can be found in time 
0{n d ) by applying a breadth first search procedure to the arrangement. The depth of a &-flat F is 
just the minimum depth of any cell of A pierced by F. Any two flats that pierce the same set of cells 
of A have the same depth. 

The space of fc-flats forms a (k + \){d — k) -dimensional algebraic set F d , in which the flats 
touching any (d — k — 1) -dimensional cell of A form a subset of codimension one. The arrangement 
of these 0(n d ) subsets partitions Ff into 0(n d ^ k+1 ^ d ~ k ^ +€ ) cells, corresponding to collections of 
flats that all pierce the same set of cells. We can construct this arrangement, and walk from cell to 
cell maintaining a priority queue of the depths of the cells in A pierced by the flats in the current 
cell, in time o(n d( - k+l ^ d ^ +t ). ■ 

We now use standard geometric sampling techniques to combine this exact algorithm with our 
lower bound on depth, resulting in an asymptotically efficient approximation algorithm. 

Theorem 8 Let d, k, and 5 > be constants. Then we can find the a k-flat with depth within a 
(1 — 8) factor of the maximum, for a collection of n points in W 1 , in time 0(n). 

Proof: We first construct an e-approximation S of the points, for the range space consisting of 
double wedges with one vertical boundary, where e = 5/ (2R(d, k)). Then if a fiat F has depth D 
with respect to S, Dn/\S\ is within an additive en term of the true depth of F with respect to the 
original point set. S can be found with \S\ = 0(e~ 2 log e _1 ), in time 0(e), using standard geometric 
sampling algorithms. We then let F be the deepest fiat for S. 

Suppose the optimal flat F* for the original point set has depth cn. Then the depth of F* in 5, 
and therefore also the depth of F in S, must be at least (c — e)|5|. Therefore, the depth of F in the 
original point set must be at least (c — 2e)n. Since c > 1 /R(d, k), (c — 2e)n > (1 — S)cn. ■ 

Although our approximation algorithm takes only linear time, it is likely not practical due to its 
high constant factors. However, perhaps similar ideas can form the basis of a more practical random 
sampling based algorithm. 

Open Problem 7 Improve the time bounds on finding an exact deepest k-flat. Is it any easier to find 
a k-flat with depth at least n/R(d, k), that may not necessarily approximate the deepest flat? 
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